llama : minor sampling refactor (2) #9386

slaren · 2024-09-09T13:22:49Z

Avoid copy in llama_sample_dist
Remove lambdas from llama_sampler_chain
Reduce overhead in logit bias sampler when there are no biases
Include call to llama_sampler_accept in llama_sampler_sample

slaren · 2024-09-09T13:36:16Z

examples/server/server.cpp

    gpt_params params;

-    llama_batch batch;
+    llama_batch batch = {};


This also fixes a crash in the server when loading a model fails and llama_batch_free is called on an uninitialized batch.

llama : minor sampling refactor (2)

4f7b808

slaren force-pushed the sl/sampling-re-2 branch from 538f5f7 to 4f7b808 Compare September 9, 2024 13:27

slaren commented Sep 9, 2024

View reviewed changes

ggerganov approved these changes Sep 9, 2024

View reviewed changes

github-actions bot added android Issues specific to Android examples server labels Sep 9, 2024

slaren added 2 commits September 9, 2024 15:58

fix type specifier in format string

fe16c7a

match discrete_dist type and function return type

2fd513a

github-actions bot added the testing Everything test related label Sep 9, 2024

slaren merged commit 5fb5e24 into master Sep 9, 2024
40 checks passed

slaren deleted the sl/sampling-re-2 branch September 9, 2024 15:10

dsx1986 pushed a commit to dsx1986/llama.cpp that referenced this pull request Oct 29, 2024

llama : minor sampling refactor (2) (ggml-org#9386)

3996820

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 15, 2024

llama : minor sampling refactor (2) (ggml-org#9386)

4a20639

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 18, 2024

llama : minor sampling refactor (2) (ggml-org#9386)

317bf4d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

llama : minor sampling refactor (2) #9386

llama : minor sampling refactor (2) #9386

Uh oh!

slaren commented Sep 9, 2024

Uh oh!

slaren Sep 9, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

llama : minor sampling refactor (2) #9386

llama : minor sampling refactor (2) #9386

Uh oh!

Conversation

slaren commented Sep 9, 2024

Uh oh!

slaren Sep 9, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants